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Abstract 



In this paper we prove two results. First we show that dynamical systems with an a-mixing 
have in the limit Poisson distributed return times almost everywhere. We use the Chen-Stein 
method to also obtain rates of convergence. Our theorem improves on previous results by allowing 
for infinite partitions and dropping the requirement that the invariant measure have finite entropy 
with respect to the given partition. As has been shown elsewhere, the limiting distribution at 
periodic points is not Poissonian (but compound Poissonian), therefore our result applies to 
cylinder neighbourhoods that don't exhibit local periodic behaviour. In the second part we prove 
that Lai-Sang Young's Markov Towers have Poisson distributed return times if the correlations 
decay for observables that are Holder continous and L°° bounded. 

1 Introduction 

One of the main motivations in studying deterministic dynamical systems is to show that they 
typically behave in a statistically regular way and exhibit on large timescales independence. In 
this paper we follow in the tradition and show that for invariant measures that are a-mixing (in 
the dynamic's sense) with respect to a possibly countably infinite partition the return times are 
in the limit Poisson distributed. 

Interest in such questions go back to the 1940's when Doeblin [8] studied the Gauss map and 
its invariant measure. Later, in the 1970s Harris studied return times for Markov processes and 
then around 1990 the interest of the return times statistics became a central topic in dynamics. 
Using symbolic dynamics, Pitskel [19] proved for Axiom A maps the return times are in the 
limit Poisson distributed with respect to equilibrium states for Holder continuous potentials. 
Hirata [13] has a similar result using the Laplace transform which he then generalised later 
in p3]. Galves and Schmitt [TO] then came up with a technique to get result for the first entry 
or return time which they applied to ^-mixing systems and where they also for the first time 
provided error estimates. This method was then greatly extended by Abadi [2], [31 0] to a-mixing 
systems. Using a combinatorial argument it is then shown in [5] that a-mixing processes have in 
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the limit Poisson distributed return times. Typically when entry times are Poisson distributed 
then so are the return times. In fact, for arbitrary return or entry times distribution there is a 
formula [II] that allows to translate one into the other one. 

In this paper we consider maps that are a-mixing with respect to an invariant measure and 
a partition which can be finite or countably infinite. In fact, in the infinite case we don't require 
the entropy to be finite. We show that the return times to cylinder sets are in the limit Poisson 
distributed using the method of Chen-Stein, which requires us to only look at two fold mixing 
sets. We also obtain rates of convergence. Naturally we have to keep away from return sets that 
'look' periodic. Clearly at periodic points the limiting distribution cannot be Poisson but is, as 
was shown in [12] compound Poisson distributed. We then also look at Young towers and show 
that they are a-mixing which then implies Poisson distributed return times. 

Let us note that it is crucial to select the return set to be some 'regular' set like cylinders 
as Kupsa and Lacroix [16\ [T7] have shown that any limiting distribution can be realised if one 
choses the return sets appropriately. 



2 Distribution for a-mixing systems 

Let T be a map on ft and \x a T-invariant probability measure on ft. Let A be a finite or countably 
infinite measurable partition on ft. We put A n for its nth join Vj=o T~i A. We assume that the 
partition A is generating (i.e. the atoms of A°° consist of single points). 

Throughout the paper we will assume that is (right) a-mixing, that is there exists a de- 
creasing sequence a(k) — > (as k — > oo) so that 



M (Anr— * (*)) 



KB) 



< a{k) 



for all A G A n , B G cflj^^ ) an< ^ f° r au n ' ^ ( see e -S- El)- Let us note that there exists a > 
so that for any n G N and A G A n one has n(A) < e~ an . For a proof of this fact see Abadi [2 J 
whose proof for finite alphabets carries over to infinite alphabets without any change. 

For a set A C ft the hitting time ta '■ ft —> N U {oo} is a random variable defined on the entire 
set ft as follows 

T A (x) =inf |jfe > 1: T k {x) G a} 

(t a (x) = oo if T k x A Vfc G N). If we narrow down the domain of ta to the set A then ta 
is called the return time or first-return time. According to Kac's theorem [15] we have that 
J A t aA[ 1 = 1 f° r I 1 an ergodic T-invariant probability measure and measurable A C ft with 
positive measure. Similarly we can then define the k th return time t\ by putting t\ (k = 1) and 
for k > 1 recursively 

T k A (x) = inf [l > T k A \x): T l {x) G a} = ^(T^^x)). 

Let us also define the recurrence time of A C ft, under the map T, to be 

r A = inf{n G N\A n T~ n (A) + 0}. 

From the mixing property we conclude that < min{^ : ot(l) < 1}. 
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Remark: In a similar way one can define a measure /i to be left a-mixing if 

n(AnT- n - k {B)) 



KB) 



< a{k) 



for all A G A n , B G &([JjA?) and n, k. The results in this paper on the distribution of return 
times (Theorems 1,2 and 3 and also in particular the results on short return times) also apply to 
left a-mixing systems since the techniques involved are symmetric. 
For A G a{A n ) (union of n-cylinders) let us define 



S A (k) 



mm (fi(A^) + a(k - w) 

Kw<k 



where A^ G a(A w ) is the smallest union of ^-cylinders that contains A. Put a(n) = Y^^Ln a U)- 



Theorem 1. Let u be a T -invariant probability measure which is a-mixing with respect to a 
generating and at most countably infinite partition A. Assume that the sequence a(n) is summable 
(i.e. a{n) — > as n — )• oo). Then there exists a constant C\ so that 



T A > 



n(A) 



k-l 
i=l 



< ( ',/('Vl)hf ^Afi(A) + n(8 A (n) + 5 A (r A )) + a(n) + \ log fi(A)\. 



for all k, n G N and A G a(A n ). 

Theorem 2. Lei /i be an a-mixing T -invariant probability measure with the at most countable 
infinite partition A. Let K > and rj > 1 be so that n J? a(n) — > as n — > oo and | log /i(.A)| < i^n 1 ? 
/or some n > 1 ana" > §, where A G c^™) is a finite or infinite union of n-cylinders and t\ 
its k th return time. Then 

(i) Exponential mixing rate: Suppose a[n) = 0(d n ), with < 1? < 1. Then there exists 
7 = 7(1?) > and C2 > such that 



/i(A) 



fc-i 



i=l 



Vi > and Vn G N. 



(1) 



fzzj Polynomial mixing rate: Suppose a(n 
C2 > sitc/i i/iai 



< c 2 t(t v i)e^ n 

0(n _/3 ) with (3 > 1 + n, f/ien i/iere exists 



fc-i 

E< 

i=i 



< c 2 t(t v 1) 
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Vt > and Vn G N. 



Remarks: 



(I) The assumption of Theorem[T]that the recurrence time r A be greater than ^ can be substituted 
with any other number of the order of n. This assumption is in place to ensure that the reference 
cylinder A does not exhibit a periodic behavior. By its very definition, the set A consists of 
points that travel together for at least n iterates of the map F. In view of this property if the 
set A revisited itself too early on by the means of a single point x that would have caused an 
entire neighborhood of A to fall into A at that same iterate. Considering the extreme case, if 
the entire set falls into A at the same iterate of F that renders A periodic. In this case the set 
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A would act like a "trap" . By asking that more time passes by before any of ^4's points comes 
back to A we ensure that the system is nearer to the time where the set will start spreading 
all over the space, by virtue of the mixing properties that govern the dynamics. In particular 
for cylinders around periodic points the limiting distribution of return times is a compound 
Poissonian distribution |12|. 



(II) Commenting on the assumption that | log [i(A n )\ < Kn 71 recall that if H(A) < oo then by the 
theorem of Shannon-MacMillan-Breiman |18j . for a.e. point if!], for the family of n-cylinders 
{A n (x)} nG fq centered at x, there exists C > such that 

\log n(A n (x))\ < Cn VnGN, (3) 

i.e. 7] = 1. On the other hand, if H(A) < oo and r] > 1 then we can give a rough estimate on the set 
of cylinders that don't satisfy the condition | log /^(^4)| < Kn v . Denote by B(n) C A n the set of all 
the n-cylinders A that satisfy | \ogn(A)\ > Kn n . Then, since H(A n ) = Y^AeA n l°g mO^)I - 
nH(A), we obtain 

nH{A)> V ii{A n )\\ogii{A n )\ > V Kn\(A n ) = Kn r > fi(B(n)) 



A n £B(n) A„eB(n) 



which implies 



M(B(n)) < ^ < 



Kn'n- 1 ~ nT 1 ' 

This shows that for rj > 1 as n increases the exception set, or "bad" set, gets smaller. The bigger 
the r] we choose the bigger coverage we achieve, where the estimates hold, but making r/ larger 
that has a direct effect on the error estimates. 

In the special case when A is an n-cylinder, we expect the first return time to be exponentially 
distributed. Indeed there are several results on this topic, notably by Galves and Schmitt [10] for 
^-mixing measures and Abadi [3] for a-mixing measures. Both of those results (as well as others) 
however assume the partition to be finite and generating. Here we obtain the exponential limiting 
distribution for the first return time for a-mixing measures and infinite (generating) partitions. 
The following corollary follows from Theorem [2] since P(ta > t) = P(VK[t] = 0) (see below). 

Corollary 1. Let [i be an a-mixing w.r.t. the partition A that is at most countably infinite. Let 
K > 0. Then for all A € A n which satisfy |logu(A)| < Kn v for some r] > 1 and rA > § the 
following hold true: 

(i) Exponential mixing rate: Suppose a(n) = 0($ n ), with < $ < 1, then there exists 
7 = 7(1?) > and C3 > such that 



(t a >t)- e-^)* 



< C 3 t(t V l)e" 7n Vi > 0. (4) 



(ii) Polynomial mixing rate: Suppose a(n) = 0(n with (3 > 1 + j], then there exists 
C3 > such that 



' (t a >t)-e 



-ti(A)t 



< C 3 t(iVl)- 



n 



P-l-ri 



Vt > 0. 



(5) 
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3 Proof of Theorem [T] 
3.1 Short returns 

Abadi has shown that for a-mixing systems the measure of cylinder sets decay exponentially, 
i.e. there are strictly positive constants K and A such that fi(A) < Ke~^ n for any integer 
n G N and any n-cylinder A. Recall that 5A(k) = mmi< w< k (^fi(A^) + a(k — w)) where A^ w ' G 
a (T-( n -™)A w ) is smallest so that A C A^ w \ 

Lemma 1. Let (£1, J-, /x, T) be an a-mixing, measure-preserving dynamical system. There exists 
a positive constant C such that for all n G N and all A G a(A n ): 

fi(A n T' k A) < n(A)6 A (k), Vk G N. 
Proof. Let w G (0, k) and A^ G a(T-^ n - w ^A w ) smallest be so that A C A^ . Then 

li{A n T~ k A) < n (a.W n T~ k A^j 
and the a-mixing property yields 

(j, (An T- k A^ < fj,(A) (ji(A^) + a(k - w) 

The lemma follows since w G [1, k) is arbitrary. I 

Lemma 2. Lei j4 G a{A n ), \i an a-mixing probability measure. Then 

F a (t a o T n ~ l < M) < n5 A (n) + Mfi(A) + a(n) 
(if M < n then the last two terms on the RHS are put equal to 0). 
Proof. For w G [l,n] we choose A^ G <r(T-( n-1 '0.A w ) so that A C A^. Then 

n-l M 
ti{An{r A oT n - 1 <M}) = ^ ii (An{r A oT"- 1 =j})+^/i(yln{r yl or"- 1 =i}) 

j=l j=n 
n-l M 

n-l M 

< £ + a(n + j -«;)) + £ m(^)(m(^) + «(j)) 

which proves the lemma since ^2jL n &(j) — ^( n )- ^ 
Recall that the recurrence time r A to the set A is defined as the smallest j for which A n 

Lemma 3. P a (t a < t) < 2n(5 A (n) + 5 A {r A )) + tfi(A) + a{n) 
Proof. By Lemmata [J and [2] we have for t > n: 

^a{t A < t) < F a (t a o T n - 1 <t-n)+ F a (t a < n) 

< n5 A (n) + tfi(A) + a(n) + n5 A (mm(r A , n)) 

< 2n5 A {n) +tfi(A) + a{n) +n5 A {r A ) 

and F a (t a < t) < tS A (r A ) < n5 A (r A ) if t < n. I 
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3.2 The Stein method 

Let fj, be a probability measure on No which is equipped with the power u-algebra £>n . Addi- 
tionally we denote by /io the Poisson-distribution measure with mean t, i.e. P Mo ({fe}) = e fc j* 
V/c £ No Also let J 7 to be the set of all real- valued functions on No- The Stein operator S : T —> T 
is defined by 

Sf(k)=tf(k + l)-kf(k), VfcGNo (6) 

The Stein equation 



Sf = h- / hdfi (7) 

M 

for the Stein operator in © , has a solution / for each /io-integrable h € T (see [6] ) . The solution 
/ is unique except for /(0), which can be chosen arbitrarily. Moreover / can be computed 
recursively from the Steinequation, namely [6]: 

/(*) = *^^X>(9 (8) 

i=Q 

= -^^E(^)-mW)|, VfceN (9) 

i=fc 

In particular, if h : No — > M. is bounded then so is the associated Stein solution /. 

Proposition 1. [6j A probability measure fi on (No,Sn ) is Poisson (with parameter t) if and 
only if 

Sf dfi = for all bounded functions f : No 

No 

A probability measure \x on (No,£>n ) which approximates the Poisson distribution fi$ can be 
estimated as follows: 



/ 



\l*(E) - IM){E)\ 



Sf dfi 

No 



(tf(k + l)-kf(k))dn 

No 



(10) 



where E C No and / is the Stein solution that corresponds to the indicator function xe- Sharp 
bounds for the quantity on the right-hand side of ([TP]) is what one is after when the Stein method 
is used for Poisson approximation. 

Lemma 4. For the Poisson distribution [xq, the Stein solution of the Stein equation TO that 
corresponds to the indicator function h = \E, with E C No, satisfies 



In particular 



E'" , „ \ m if m < t , . 

t + (2 + t) log f ifm>t. 
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Proof. We consider the two cases (i) k > t and (ii) k <t. 

(i) k > t: For h = xe, from the representation ([9]) for the Stein solution we have 

fxsW = E (MO - MoW) |- 

i=k 

Therefore, 

|/x»(*)l < ^^f>«-/^)|| 

i=k 

< (fc-1)! 

i=k 

(k-l)\ t k ( y> t ^_ 

t fc Jfe! I ^/c + l/c + 2'"/c + i 

If i > t then each term in the infinite sum in (|13p is no greater than If i < t, all terms in 

the sum in f|13|) are clearly no greater than 1. Hence 

(ii) k < t: Using the alternative representation © for the Stein solution f XE , this time, we 

get 

(fc-l)!^** (fc-1)! , 

l/x-WI < E " "Wl a * E sr * Sf^ < i 

i=0 i=0 v ' 

as the sequence {jyljeN is increasing for j < t and decreasing for j > t. This completes the proof 
of inequality (jlip . The second statement is now obvious for m <t. On the other hand if m > t 
then it follows from the inequality YliLt+1 1 — 1°& T - ^ 

3.3 Return times distribution 

Now we want to approximate the distribution of r§, that is approximate the function P(t^ < m) 
for all k > 1 and all m G M + . Let A € o"(„4 n ) and denote by W m (x) the number of visits of the 
orbit {T{x),T 2 (x),. . . ,T m (x)} to the set A, i.e. 

rn 

W m (x) = Y J XA{T\X)) 
3=1 

where \A is the characteristic function of the set A that is xa(x) = 1 if x £ A and xa(x) = 
otherwise. Then 

P(4 < m) = 1 - P(r^ > m) = 1 - ¥(W [m] < k) 

where [m] is the integer part of m. Therefore, our problem of approximating the distribution 
of t\ becomes equivalent to approximating the distribution of W m for all m E N. The Poisson 
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parameter t is the expected value of W m (i.e. t = fJ>(W m )). If we put p, = fi(T l A) = fJ,(A) \fi = 
1,2, ... , then 

m m 

t = v(Wm) = V (xaT 1 ) = X> = mfi(A) 

i=l i=l 

i.e. m = [t/fi(A)]. If h = xe with E an arbitrary subset of the positive integers, E C No, then 
we obtain from (I10D 



Sfdfi 



hdfx — I hdfio 

N JNo 



\W m € E) - fx (E)\ 



and in turn, since the Stein Operator S for the Poisson distribution is given by ([6]), we obtain 

\F(W rn E E) - ME)\ = \E (tf{W m + 1) - W m f(W m ))\ V£ C N . 

Notice that the difference |P(W m E E) — /j,o(E)\ above gives exactly the error of the Poisson 
approximation. We hence estimate 



\F(W m e E) - iio(E)\ 



\i=l 

m m 

Y,P l ^f(W m + 1) - ^ftE(/(W m )|/i = 1) 
i=i i=i 

m 

J^Pi (E/ (W m + 1) - E (/(W m )|Ii = 1)) 
1=1 

m / m m 

^2 Pi \J2f( a + 1 ) P ( H/ ™ = a ) - S /( a ) P ( H/ ™ = a l J i = !) 

j=l \a=0 
m m 



a=0 



i=l a=0 

where we put h{x) = xaT 1 (x) the characteristic function of the set T~ l A and 

e a ,i = |P(W m = a) - P(VK m = a + = 1)| . 



(14) 



(15) 



The function / above is the solution of the Stein equation ([7]) that corresponds to the indicator 
function h = xe in the Stein method. In fact bounds on / have been obtained in Corollary (5J 

Now, in view of the new representation for |F(W m E E) — /j,q(E)\ we need to look at the term 
e ai i more closely. If we put W m = W m — xa°T' 1 then the mixing condition yields the following 
estimates on e a ^: 

e a ^ = |P(W m = o)-P(W m = o+l|/i = l)| 

■({W* =a}nr-*A) 



\W m = a) 
\W m = a) 



\W m = a)fx(A) + e' a>i 



< \F(W m = a)-F(W m = a)\ + 
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where e' ai = P {{W l m = a} n T^A) - P(W^ = a)fi(A) {e' a i = if all Ij are independent) and 
£ a = maxi |P({W^ = a} n T~ { A) - ^{W^ = a)fJ,(A)\. The bound on e 0)f has two terms, the first 
of which is 

\F(W m = a) - P(w£ = o)| < P(/i = 1) = n(A). 

The second term, which contains £ a , is the error due to dependence for which we get estimates 
in Proposition [2] below. 

Proposition 2. There exists a positive constant C so that for all n £ N and for all A € a(A n ) 
satisfying r A > § the following estimate holds true 

{{W l m = a} n - ¥(W l m = a)n(A) 

< Cfi(A) inf ( Afi(A) + nfi(A) + n5 A {n) + nd A (r A ) + a(n) + J 
A>o V M^)/ 

w/iere W m = ]T^Li XA ° T j and W l m = Y^l<j<m XA ° T j . 

Proof. Let A « m be a positive integer (the halfwith of the gap) and put for every i £ (0, m] 



i-(A+l) 

wt= E ^ oTi ' 

i-1 
j=i-A 



rn 1 w i 



h+ 



w^ + = E ^ oTi ' 

jf=i+A+l 
i+A 

U^ + = E XAOP, 

W i -If 1 = W { ~ + W i,+ 



with the obvious modifications ifi<Aori>m — A. With these partial sums we distinguish 
between the hits that occur near the i th iteration, namely U?A~ and (7m + , and the hits that occur 
away from the i th iteration, namely Wm~ and Wm + ■ 

The 'gap' of length 2A + 1 allows us to use the mixing property in the terms Wm and its 
size will be determined later by optimising the error term. 

We then have, for < a < m — 1, a E No, that 



\{W m = a + 1} n T~ i A) 



p({C = «}nri) 

E H{W*f = a ± } n {Utf = a°>±} n T"*A) 

a=(a-,a°-,a°-+,a+) 
s.t a|=a 



(intersection of five terms). For < a < m - 1 we have 

P {{WL = ^ n T~M) - P (Wj, = a) fx(A) 
and will estimate the three terms 

r x = p ({w^ = a} n r~U) 

# 2 = P ({Wj^ = a} n T-M. 



<R 1 + R 2 + R 3 



{Wf n = a}nT- t A 
=a>(I f = 1 



TO = a 



/i(A) 
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separately as follows. 

Estimate of R±: Here we show that short returns are rare when conditioned on T~ l A. Observe 
that 



{wi = a}nT~ i A c ({w^ = a}nr-Mju({c/^>o}nr- i A) 
M = «}nrM c ({C = a}nr ! i)u(R>o}nr i i). 

Since U % m > implies that either U^f > or Um > we get 

\¥{{W l m = a} n T~*A) - r({W^ = a}n T~ l A) \ < f>({U l m > 0} n T~*A) < bj + bf 

where 

bT = ¥({U l r ;- > 0} n T~ l A) and bf = P({% + > 0} n T~*A) . 

We now estimate the two terms, b~ and bf, separately as follows: 

(i) Estimate of bf: By Lemma [3] 

bf = p({ic+ > o}nrM) 

= V{U l rk + > 0\li = IMA) 

= P a (ta < A)/i(A) 

< Cfx(A) ((A + n)fi(A) + n(5 A (n) + 5 A {r A )) + a(n)) . 

(ii) Estimate of &r : If U^T > then {£/£f > 0} C Uf=i T-^U and therefore 



We show the following symmetry 



n (t^a n [j t~^a\ = n [t- { a n [j t-^a\ 

For that purpose let Si = Ufc=i <^,fc where J^k = T~ % A n T~^ _fe )j4 and similarly 5j = |j£t=i Ji,k: 
Ji k = T-' l A n T-^A. We now want to show that [x(Si) = fi(Si). We decompose Si into a 
disjoint union as follows: 

A 



where 



Then 



k=l 
fe-l 



= ^i,fc \ [J ^i,fc n Jij. 

3=1 

/ • A \ A 
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Similarly, Si is the disjoint union of = Jj^ \ UjLi Ji,k ^ «^i,j> k = 1, . . . , A. Then 

fc-i fc-i 

3=1 3=1 

where we have used that F~ k Ji^ = and F~ k (Jj^ n Jij) = PI Ji^-j,0 < j < k — 1. 
Therefore, by the invariance of the measure ^(V^fc) = ^(V^fc) and consequently 



k=i 



k=i 



We therefore obtain 

K = i 1 



^ (j t-^-Qa n t~m^ = /i ^ J r^ +fe U n r-uj = P ({?74 + > 0} n t~m) = bj 

Combining (i) and (ii) yields 

i?i < C ((A + n)fi(A) + n(6 A (n) + S A (r A )) + a(n)) . 



Estimate of R^: Now we show that short returns are rare. We proceed similarly to the estimate 
of R\. The set inclusions 

{Wi n = a} C {Wi = a}U{U i m >0} 
{W^ = a} C {W* m = a}U{lP m >0} 

let us estimate 

(Wi = a) -P(W4 = a) < P(t£ > 0) < 2P ^|J{/ i+fc = 1}^ < 2A M (A). 



Hence 



i? 3 < 2A^(A) 5 



Estimate of i?2 : This is the principal term and the speed of mixing now becomes relevant. 
Recall that W'^x) = W^~{x) + W l ^ + {x) and 



R2 



\W'L = a} n T-M) - P (W* m = a) fx(A) 

P({^ ± = a ± }nr-M) 



a=(a ,a+) 
s.t |S|=o 



a=(a ,a+) 
s.t |a|=a 
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For each a = (a , a + ) for which \a\ = a we have 



P({W, 



} n t~m) - p (w; 



= ) m(A) 



< R2,l + R2,2 + -^2,3 



where 



-^2,1 
R2,2 
#2,3 



P ({W^ = a ± } n T~ l A) - P ({W^+ = a+} n T _i A) P = a - ) 

P = a + } n T _i A) - P (W^ + = a + ) //(A) P (W^~ = a") 

P = a + ) P = a") - P (W^ = a ± ) //(A). 



We now bound the three terms separately: 
Bounds for i?2,i= Due to the mixing property 

P ({W^ = a ± }n T~ l A) - P ({W^ + = a + }D T _i A) P (W£ 

we obtain 



) 



<a(A)P (Wjf =a") 



a=(a ,a+) 
s.t |a|=a 



2 p({^ ± = a ± }nr-M)- ^ p({ic + = a + }nr ! i)p(e = a-) 

a=(a _ ,a+) 
s.t |a|=a 

< £ «(A)P(^- = a-) 

a=(a~,a+) 
s.t a|=a 

< a(A). 

Bounds for i?2,2= We have 

#2,2 = p (w£- = o") p ({^ + = « + } n r-*A) - P (W£+ = a+) //(A) 
< a(A)P(W,ji- = a-)M(A) 



and therefore 

a=(a ,a+) 
s.t |o|=a 



= a + } n T-*A) P (Wjf = a") - P = a+) P «f = a") fi(A) 



< Yl a(A)P(W^-=a-)/x(A) 

a=(a~ ,a+) 
s.t |a|=a 

< a(A)/x(A). 
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Bounds for R 2 3 : Here we get 



E 



a=(a ,a+) 
s.t |a|=a 



p(w; 



H(A) < «(2A)m(A). 



Combining the estimates for #2,1, #2,2 and i?2,3 we obtain that 

i? 2 < #2,1 + R2,2 + #2,3 < Ca(A) 

Finally, putting the error terms R\, R2 and R3 together yields 

P({W* = a}n T-M) - nw l m = o)im(A) 

< C mf (/u(A)((A + n )A«(i4) + n(S A (n) + 5 A (r A )) + a(n)) + a(A)) 

for some C € M + independent of A. 
Proof of Theorem [TJ By Proposition [2] 

£ a < C mf (/i(A)((A + + n(^(n) + 5 A (r A )) + a(n)) + a(A)) 

and therefore 

e«,i < KA) + 4^<Cmf ((A + n)fi(A) + n($ A (n) + <k(r A )) + a(n) + 4^r) • 
v{A) A>o \ MA) / 



M (A) 



Let us note that replacing the value t for t* = 
With the new estimates for the error term e Q) j in 
log m = 0(\logu(A)\) and m = [t/n(A)]) 



n{A) results in an error of order 0(fi(A)). 
land we can now Lemma 0] to obtain (as 



fe-1 

i=i 



-t ' 



< Ct(t V 1) mf ( (A + n)n(A) + n(5 A {n) + S A (r A )) + a(n) + 



I 



Proof of Theorem [2} (^ij Polynomial mixing: In the polynomial case where a(k) 
0(k~P) with some f3 > 2 one obtains 



/z(A) 



fe-l 



E^ 



< Ct(t V 1) inf Afi(A) + — 3— r + 

A>0 * nP-l 



A~^ 



\\ogn(A)\ 



where we used that /j,(A^) < e aw for some a > which implies that 6 A (n) < e a 2 + a(^) and 
(also note that a(j) = ©(j - ^ -1 ))). In order to optimise A put A = ^ ^ for some w £ (0, 1). 
Then we get 



g, (^) + + ^) £ + ^ 4- 11(A) 
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The best value for w € (0, 1) is ui = -^j-j- and therefore 



inf (W) 1 -" + + KAf"- 1 ) < 2u(A)^ + J_ < Vn € N 

for some constant C. Since by assumption [ log(^(j4))| < Kn v 

& (m^^ + ^t + M^- 1 ) |iog(^))l <c^^. 



0< 

for some C > and we finally obtain 



k-l 



E^ 



i=l 



< Ci(iVl)- 



1 



0-i- v ■ 



n 



(16) 



fzzj Exponential mixing: In this case when a(k) = O{0 k ) with 9 < 1. Then 



T A > 



fc-i 



e^; 



i=i 



< Ct(t V 1) inf Aa(A) + n6 n + — -- 



|Iog/i(A)|. (17) 



and in order to estimate the RHS let us put A = (1 + e) °f^t^ for some e > 0. Then 



inf Afi(A) + n6 n + 
A>o 1 



|lo g 0| 

log^(A)| 
| log ^| 



and therefore 

mf Ua{A)+n9 n + ^\ \\og a{A)\ < ((1 + e) 1 + n9 n + M (A) £ ) |log/x(^)|. 

Since for any 5 G (0,1) | log x | = O (^) as x — > + we obtain |log/i(A)| < C^jJ^y* for some 
constant C independent of A Hence, as the measure of cylinder sets decay exponentially fast we 
obtain 

inf (A„(A) + + J^j |log/^)| < Ce^ 
for some 7 > 0. Therefore 



fc-i 

i=l 



< Ct(t Vl)e" 7n . 



□ 



4 Return Times on Markov Towers 



4.1 Mixing Properties derived on the Markov Tower 

Let F be a differentiable map on a manifold M and Slo a subset of M. As in |20t [2T] we assume 
that S7q is partitioned into sets Oq i, i = 1,2,... so that there is a return time function it! : Oq — > N 
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which is constant on the partition elements f2o,« and which satisfies that F R maps flo,i bijectively 
to the entire set Qq. Let us put = {(x,j) : x G Qo,i} for 3 = 0, 1, . . . , U(fio,i) — 1< The 



Uoo i i-R(f^0,i)" 
1=1 Ujr=0 



fL- i is called a the Markov tower for the map T. It has the associated 



space 

partition „4 = ■ < j < R(£lo^),i = 1,2,...} which typically is countably infinite. On the 
tower fi we have the map T which for x 6 Oo,j is given by T(x,j) = (x,j + 1) if j < i?(Slo i) ~~ 1 
and T{x,R(fl ,i) - 1) = (F^^O). 

For points x, y G Slo one defines the function s(x, y) as the smallest positive n so that (T R ) n x 
and (T R ) n y lie in distinct sub-partition elements Oo j of Q,q. Two points x and y in belong to 
the same iV-cylinder if and only if they remain together (in the same partition element) for at 
least n iterations of T R , i.e. if s(x, y) > n, where N = X)?=o R(T R y ■ 

The space of Holder continuous functions C 7 consists of all functions ip on SI for which |<^(x) — 



t f(y)\ < C^ s<yX,v \ The norm on C 7 is 



l^loo + Cm, where C« is smallest possible. 



Let v be a finite given 'reference' measure on f2 and assume that the Jacobian JT R with 
respect to the measure v is Holder continuous, that is, there exists a 7 £ (0, 1) so that 



JT R x 



JT R y 



1 



< const^^' 7 ^ 



for all x, y £ S7o,i, i = 1, 2, ... . 

If the return time R is integrable with respect to m then by [21J Theorem 1 there exists 
a T-invariant probability measure fi (SRB measure) on which is absolutely continuous with 
respect to v. Moreover the density function h = ^ = lmin^oo C n \ is Holder continuous, where 
A can be any initial density distribution in C 7 . The transfer operator C : C 7 — > C 7 is defined by 
C(p(x) = Ylx'eT^x jt(J) ' V ^ C 7 , and has the property that v is a fix point of its adjoint, i.e. 
C*v = v. In [21] Theorem 2(11) the /^-convergence was proven: 



\L k \ 



Hl- <v{k)\\MV 



(18) 



where the 'decay function' p(k) = 0(k^ 13 ) if the tail decays polynomially with power /3, that is 
if is(R > j) < const.j - ' 3 . If the return times decay exponentially, i.e. if v{R > j) < const.'!?- 7 for 
some 1? € (0, 1), then there is a # € (0, 1) so that p(k) < const. 

Recall that for each n € N the elements of the nth join A n = \f™=o T~ l A of the partition 
A = {fij j} are called n-cylinders. For each n G N the n-cylinders .4™ form a new partition of the 
space, a refinement of the original partition. The u-algebra T generated by all n-cylinders A £ , 
for all I > 1, is the cr-algebra of the system (SI, J 7 , fj,). 

We will need the following arithmetic lemma to carry estimates for cylinders over to union of 
cylinders. 



Lemma 5. Leta±,a2,--- andb\,b 



2, . . . be positive reals. Then 



h + b 2 + 





a; 


< sup 




i 





Proof. If we put e = supj 
Summation over i yields 



1 — ff then we have by assumption (1 — e)bi < a» < (1 + e)b{ 
(l-e)J>< J><(l + e)J> 



15 



and therefore 



(l-e)<|*£<(l + e) 



which implies the statement. 1 
Lemma 6. There exists a constant Cq so that ||£ n Xyi|| 7 < Cq for all A G a(A n ) and n. 
Proof. We first show that 



log 



JT n (x) 



< C17 



s(T n x,T n y) 



JT n (y) 

for all pairs x, y G A, A G A n and Vn G N. For A G A n and x, y € A we have x, y G f2ij for some 
i < Rj = R(Qo,j)- Put no = Rj — i and then successively = Rj — i + Ylk=X ^ife> wnere the 
are such that T ne x G O 0)ir Clearly T ne x,T ne y G for fc < R jt for all £ for which n e < n. Put 
L = mscK ni < n £ and we get from the distortion property 



log 



JT n (x) 



JT n (y) 



< 



L-l 



JT R ik(T n *(x)) 

fog 5 

S JT R ^{T^{y)) 



L-l 

< C2 X^™ si l " l '■'•'•' / '" t 
fc=0 

{T n L(x),T n L (y)) 

s(T"(x),T™(y)) 



< c 3 7 s 



< 



C17 



for some ci. 

Now, if x, y G fijj for some then let A G *4 n and x',y' G ^4 be so that T n x' = x and 
T n y' = y (for x',y' to exist one needs A C ^ij). Then we obtain 

£ n h XA (y) _ h(y') JT n (x') 
C n h X A{x) h(x') JT n (y') 

which implies by the above estimate and the regularity of the density h that 



, £ n h X A(y) 

log — 



< 



£ n h XA (x 
for which we can also write 



log 



JT n (x') 



+ 



JT n (y' 
C n h XA (y) 



log 



h(x') 



< Cl Y (x ' y) + C47 s(x, - y,) < c 5 ~f s{x ' v \ 



£ n h XA (x) 



< c 6 Y {x,y) G A T ' 



Now any A G cr(A n ) is the disjoint union of some Aj G A n . We now apply Lemma [5] with the 
identification Oj = C n h X A 3 (x), bj = C n h X A j (y)- Since C n h X A = J2j ^h-XAj we obtain 



1 _£h XA (y)_ 
C n h XA (x) 



< c 6 ~f s(x ' y) VA G o-(A n ),Vx,y in some n^,Vn. 
Let us note that in particular (cf. |21| Theorem l(ii) and Sublemma 1) that (as Y^AeA n X a = 1) 



C n l(y) 
C n l(x) 



< ci7 s(x ' y) . 
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Since |£ n l|oo < 1, we now obtain 



\£ n hxA(x) - £ n h XA (y)\ < \C n h XA (y)\ 



C n h X A(x) 



£ n h X A{y) 



<c 67 



s(x,y) 



for some constant Cq. Hence C n hxA 6 C 7 and, moreover, is bounded in the C 7 -norm uniformly 



in A G a(A n ) and n € N. 



I 



We proceed as in the proof of [21] Theorem 3 and put A = C n hxA which is a strictly positive 
function. Then rj = j^rf is a density function as v (A) = i/(£ n /ix^0 = K^AX) = n(A). Moreover 
||A|| 7 is by Lemma [6] bounded by Cq uniformly in n and A G a(A n ). Denote by p(k), k = 1, 2, ... , 
the rate of the decay of correlations which is p(k) = 0{k~@) if the return times tail decays like 
andp(/c) = C(^ fc ) for some $ G (0,1) if the return times tail decays exponentially. We obtain 



fi(AnT- k - n B) - n(A)fi(B) 



u{h XA (XB o T k+n )) - v{hxA)y(h X B) 
fi(A)u( X BC k X) - u{h XB ) 

fj,(A) J X B(jO k X-h) dv 
(C k X - fi(A)h) du. 



(19) 



In particular we thus obtain the estimates using the L -convergence of C k X - h from CEHD yields 



fj,(A n T~ k ~ n B) - ii(A)n{B) < n(A) I X B\£*X-h\du 



< 



< 



KB) 

n(A)ci\\X\\ 1 p(k) 

KB) 
c 2 p(k) 



(20) 



as ||A|| 7 = ^rjyll/lh < ~§A)' ^ e u PP er estimate which only uses boundedness of h and the 
pullbacks of the density A is useful for small k and [i(B). 

Theorem 3. As described above let T be a map on the Markov Tower structure Omega with a 
reference measure v and a return time function R. Let fi be the absolutely continuous invariant 
measure. Let K be a constant. Then for every A n € a(A n ) for which \ log fi(A n )\ < Kn, r^ n > § 
and sup^ n R < ^ , the following results hold true: 

Ifv(R > n) = 0($ n ) (■& G (0, 1) ) or if v(R > n) = 0(n~P), for some /3 > 2, then there exists 
7 > and C-j such that 



K A n) 



k-1 i 

-t l_ 



i=l 



< C 7 (tVl)e 



■yn 



Vf > and Vn G N. 



where r\ is the k th time the system visits A n . 

The condition rA n > § which is to avoid periodic looking points can be weakened to any positive 
fraction of n on the right hand side (and even somewhat more). Similarly, the second condition 
sup^ n i? < § can be weakened by relaxing the upper bound -|. It's purpose is to control short 
returns. 
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4.2 Return times distribution 

Here again we denote by p(k), k = 1, 2, . . . , the rate of the decay of correlations as in ([20]) . that 
is p(k) = 0(k~@) if the return times tail decays like k~@ and p(k) = 0('d k ) for some t? G (0, 1) if 
the return times tail decays exponentially. Let us now prove the main result for Markov towers. 

Theorem 4. Let T: Q — > f2 be a Markov tower as above with a 'reference measure' m and a 
return time function R. Let \i be the absolutely continuous invariant measure for T and p(k), k = 
1,2, ... , the rate of the decay of correlations. 

Let A G o~(A n ) so that rA > § and sup^i? < \. Then for all A (n < A <$C m) and m > t: 

\F(W m G E) - no(E)\ < const, f A M (I) + (2 + t) P ^ ~^ logrn^) , 

where A G a(A n ) is smallest so that A C A and h = min(r J 4,n). 

The condition > § and sup^ R < § is needed to control very periodic like returns which is 
where the Poissonian distribution fails. 

Proof. As before we put W m = Y^j=i XA ° T 3 and = Yli<j<m XA°T J . We have to estimate 
the following quantity: 



|P(W m G E) - fi (E)\ =J2piJ2f( a+ X ) e ^' 



1=1 a =0 

where 



e a ,i = \nW m = a) - F{W m = a + l\Ii = 1)| < \¥(W m = a) - P(W* = a)\ + 
i a = max |P({W^ = a} n IT* A) - P(W^ = a)^{A)\ . 



Clearly 

|P(W m = a) - P(W* = a) | < Wi = 1) = niA) 

which leaves us to estimate £ a and to execute the sum over a where we will use the bounds from 
Lemma [6] for /. 

Let A « m be the halfwith of the 'gap' and for i G (0, m] define as before 
i-(A+l) m 

wt= E ^ oTi ' w ™= E ^ oTj > 

3=1 j=i+A+l 
i-1 i+A 

j=i-A j=i+l 
jji _ jji- i yyi — yyi _ jji — ryi- . Tyi,+ 
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(with the obvious modifications if i < A or i > m — A). For a E [0, m] we have 

p({w m = a + 1} n r~M) = P({w^ = a} n r _i A) 

£ P({^ ± = « ± }n{% ± = a°' ± }nT-A) 

3=(a-,a° <-,o Q >+,o+) 
s.t |a|=a 

where the terms inside the sum are measures of intersections of five sets. Then 

P {{W l m = a}n T~*A) - P {W % m = a) n{A) = R x {a) + R 2 (a) + R 3 (a), 

where 

fli(o) = p({w^ = a}nrM) -p({#4 = a}nr ! i) 

R 2 (a) = P ({W* = a} n T^Aj - P (wjj, = a) P (J< = 1) 
flg(a) = (p(w^ = a) -P(W^ = a))/x(A) 

estimated separately as follows in increasing order of difficulty. 

Estimate of R3: We first show that short returns are rare. The set inclusions 

{Wl l = a} C {T#4 = a}U{[/4>0} 
{W^ = a} C {W* n = a}U{U* m >0} 

let us estimate 

/ A \ 



Wj n = a) -P(Wj 



< P (C4 > 0) < 2P |J {l i+fc = 1} < 2A/x(A). 



\fc=i / 
Hence 

|123(a) I < 2Afi(A) 2 

for every a = 0, . . . , m. 

Estimate of 12i: Here we show that short returns are rare when conditioned on T~ l A. Observe 
that 



{W l m = a}n T~ l A C {{W' m = a}n T~ % A \ U ({U l m > 0} n T~ l A) 
{W^ = a} n T~ l A c {{W l m = a} n T - M) U ({tfj, > 0} n T~M) . 

Since > implies that either C7m + > or Um~ > we get 

|P({W£ = a} n T-*A) - P({W^ = a} n T~*A) \ < f({U l m > 0} n T~M) < 6~ + bj 

where 

67 = F({U%- > 0} n T-M) and 6+ = P({C/4 + > 0} n T~ l A) . 
It was shown in Proposition [2] that = 67. 
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Now let A G cr(A n ) smallest so that A C A, where n = min(n, r^) (by assumption h > n/2). 
Then A is the disjoint union of some Aj G A n . Now put £j = R(Aj) (which is by assumption 
R(Aj) < n) and let us estimate n(Aj). Then Aj[, = C e ^hx^. is in fact given by Aj*.(x 



where y G A,- is such that T^'y = x, and x is any point in Qq. Since by [21] Sublemma 2 

JT^(y) 



log' 



< ci V y, y' G A 



JT^(y') 

for some ci, and as the density /i G C 7 is positive, we get 



log 



< C2 V x, x' G SIq , 



and thus v(Aj) G [^-, C3] — j-— V y £ Aj as : Aj — )• il is one-to-one (C3 > 0) as £j = R(Ah). 
We therefore obtain 

m(a n {t a < A}) < n{A n {r A < A}) = ^ n {t a < A}), 



where by the decay of correlation formula (J20 



KAj n {r A < A}) = (i(A j )F(T A <A)+ f C A '^\ A dv 

J{r A <A} 



< /i(Aj)P(rA < A) + c 4 / A a di> 

J{r A <A} J 

< M(^i)P(TA < A) + c 4 -^>ll({t a < A}) for any y e Aj 



< 



(l + C 2 C 3 C4)^(i j )/i({TA < A}) 



for all j. Since h({ta < A}) < A(j,(A) we obtain 



= »a({ta < A}) < M ^ n ^ } ~ A}) < (lc 2 c 3 c 4 )AMi) 



and thus (05 < 2 + 2C2C3C4) 



for all a G [0, m] 



fli(a) < bf + br < c 5 A/i(A) 



x) + 



Estimate of R2: Here the decay of correlations play a central role. For Wj^(x) = VFm ( 
we obtain as in Proposition [2] 

R 2 (a) = J2 ( P = a± > n T_M ) " P = a± ) V( A )) 

a=(a~ ,a+) 
s.t |a|=a 

where a - + a + = o. As before we split the summands into three separate parts i?2,i> ^2,2> ^2,3 
which we sum over a and bound separately as follows. 
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Bounds for R21' The mixing of sets formula (|19p gives us 

R 2 ,i(a-,a+) = ^({W^ = a ± }nT-' i A) - ^{{W^ + = a + }r\T~ i A) ^(W^- = a~) 



[ (£ A - n \ a --h^(X a -)) du, 



where A a - = C l+n h X x a - , X a - = {W^~ = a~} and Y a + = T A ~ n ({W^ + = a+} n T~ l A). 
According to Lemma [6] ||A a -|L < Cq for any value of a~,i,m and n. Thus, summing over 
a = 0, . . . , m, we obtain 



J2f(a + l)R 2tl (a-,a + ) 



a=0 



< 



E 



a ,a 
m m 



f( a - + a + + 1) f (C A ~ n X a - - hfi(X a - )) dv 

JY„ + 



lib III/ n 

< EE + a+ + l )\ £ a-,a+ / {^~ n K- ~ hn{X a -)) du 

a+=0a-=0 Y a+ 

where £ a - a + IS the sign of the integral f Y (£ A ~ n \ a - — hp,(X a -)) dm. We now split the sum 

a "I" 

over a~,a + in geometric progression and use the bounds on |/| from Lemma H] to obtain 



a=0 



[log 2 2m] [2m2- fc ] 

£ E E " + ' 

k=0 a-,a+=0 
[t] 



a~ + a + + 



+ E £ a-,a+ / (/: A ~ n A a --M^a-)) rf^- 



a -,a+=0 



The first (triple) sum is estimated by I + II, where I is for the terms with e = +1 and 1/ 
contains the terms for which e = — 1. For every we use the fact that a _^| +1 < for 
a~ + a + £ [m2" fc ,m2^^ 1 '). Hence 



[log 2 2m] [2m2" fe ] 

E^E E / (/: A -V-M^-))^ 

a+=0 a-e[0,2m2~ fc l <»+ 



fc=0 



-e[o,: 

S -t £ a-,a+ =1 



log 2 2m] [2m2" fc 

EZ + t \ -< 

fc=0 a+=0 a+ 



^ A ^,a+,l-^(^a+,l)) dv 



(notice that all terms are positive), where 



L 



k,a+,l 



E v = £ i+n xx a+ . 



a-£[0,2m2~ 



and X a + i = U Q -e[o 2m2^ fc ] * s a disjoint union in a(A t+n ). Hence by Lemma H we have 

S -t £ a-.a+ =1 
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\Lk a + 1II7 < Ce f° r an values of a + ,i,n. We thus obtain 



[log 2 2m] 2 + ^ [2m2" fc ] 

7 ^ E E ll^,a+,ill7P(A-n) 

fc=0 a+=0 
[log 2 2m] 

< ^ £ ^ 2m2 ~ V(A - n) 

fc=0 

< C6(2 + t)p(A — n) logm. 

Similarly one estimates the second contribution 77 by putting L k a + 2 = J2 a -e[0 2m2- k ] = 

S.t £ _ _|_= — 1 

C' l+n Xx where X a + 2 is the disjoint union Ua^efo 2m2~ fc l -^a-- We then get as above in estimat- 

S.t £ _ += — 1 

ing the part / (again for every k we estimate |/(a + 1)[ < J^l\ for a - + a + € [m2^,m2 - ^ -1 ')): 



[log 2 2m] [2m2" k ] 
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k=0 



[log 2 2m] 



a+=0 a-e[0,m2- fe ] 



S.t £ _ 4- =—1 



\2m2- 



E2 -\- t sr~ \ 
m2' k ^ 

k=0 a+=0 
[log 2 2m] 2 [2m2" fc ] 



£ A ^ M+ ,2- M*a+, 2 )) dv 



n) 



^ E ^1 E !l L M+,2ll7P( A 
fc=0 a+=0 

< cq{2 + t)p(A — n) logm 

as ||-£fc Q + 2II7 ^ C*6 by Lemma [6l 

In the same way one estimates the second sum above which does not involve a sum over k: 



E / (^""V - M*a~)) ^ < C 6 tp(A 



n) 



a-,a+=0 

These estimates combined yield {cj < 2cq + Cq 



^/(a + l)i? 2)1 i 



a , a 



a=0 



< 07(2 + t)p(A — n) logm. 



Bounds for i?2,2 : Here we get 

^2, 2 (a-,a+) = (v({W^- = a + }nT- l A)- f ,(W^ + = a + )^A))^W^- = a-) 
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where A* = D Jrn hxT~ i a an d therefore we get the following estimate which is independent of the 
value of a: 



^2,2(0 ,a + ) 

a~ +a+=a 



y. {{W%+ = a+} n T~*A) - n(W%+ = a+)fi(A)\ 



a=(a ,a+) 
s.t |a|=a 



< V / |£ A_ri A* - hfi(A)) \du 

< [ \£ A - n \*-hfi(A))\dv 

7 T A-n|J a+{ ^+ =a + } 

< C 6 p(A-n) 

again using the fact that for different a + the sets T A ~ n {Wm + = a + } are disjoint in cr(\J£L i A t '). 
Bounds for R23: We proceed as in the estimates about R2 1. Put 

i?2 l3 (a-,a + ) = (/x = o+) /i (W£" = a-) - /i (W&± = a±)) 

and we obtain in the same way that 



£ /(a-+a+ + l)i? 2 , 3 (a-,^ 
0<a — +a+<m 



< c 7 ^(A) (2 + t)p(2A - n) log m. 
Combining the estimates for i?2,i 5 -^2,2 and i?2,3 we obtain that (eg < 2c 7 + Cg) 



J2f(a + l)R 2 (a) 



a=0 



< cs(2 + i)p(A — n) log m. 



On the other hand, using the estimates on Ri and -R3 together with the Lemma |4] we get 



J2f(a + l)(Ri(a) + R 3 (a)) 



a=0 

Hence 



< 



A(2 M (A)+c 5 M(A))£|/(a+l)| < A(2^(A)+c^(A))(t+(2+t) log j 



a=0 



< 



a=0 



m 



< 2An(A)(p(A) + c 5 /i(A)) ( t + (2 + t) log — ) + c 8 (2 + t)p(A - n) log m 



if m > i, and therefore 



\W m EE)- no(E)\ < c 9 A/i(I) (t + (2 + t) |log/i(A)|) + c 8 (2 + t) p(A . ^ logm 



as m = [t/n(A)] for some C8,cg G R + independent of A. 
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We now look at different decay rates. If we look at a sequence of sets A each in a(A n ) for 
n = 1, 2, . . . which satisfy the conditions of Theorem U] (i.e. ta > §, sup^ i? < §) then we look 
at the two cases when (i) p{A) ~ ra~ 7 for some 7 > and (ii) p{A) ~ p n for some p € (0, 1). 

(i) If the target set A has polynomially decaying measure, p(A) ~ n -7 , then if p(k) = 0{k~P) 

we optimising the error terms requires that the 'gaps' A are of the order n^ +1 where /3 > 3. If 
p[k) = 0{"d k ) is exponentially decaying then the best choice for the 'gaps' is A ~ logn. 

(ii) In the case of Theorem[3l when the return set A has exponentially decaying measure, p(A) ~ 
p n for some p < 1 (e.g. single n-cylinders) then 



p(k) = Oik- 15 ) -> A ~ p'W+V 1 
p(k) = C(#) -> A ~ n 



By Theorem [J] this implies 



ti |9-2 
+ 1 



|P(W meJ B)-/i (S)|< constat Vl)np^ 

I const. (t V l)np2 

where we used that p{A) < const./) 2 . This proves Theorem [3l 
4.3 Examples 

Example 1: In a subshift of finite type with a finite alphabet V = {v\,V2, ■ ■ ■ v m }, an m x m 
transition matrix M with entries in {0, 1} and the sets of all admissible one-sided sequences 
denoted by = |(xo,xi, . . . ) : Xi G V and M XiXi+1 = 1 Mi € No}, if T is the shift operator 
from Tj^j onto SlL which is acting on a sequence by shifting all its symbols to the left, i.e. 
(T(x))i = Xi + \, then a typical n-cylinder is defined as 

C([a ,ai, . . . ,a„-i]) = {(x ,xi, . . . ) 6 : x = a , x x = ax, . . .x n _i = a n _i} . (21) 

Given a partition A of a space the set of n-cylinders A n , for all n 6 N, forms a new partition 
of the space, a refinement of the original partition. 

Example 2: The binomial map 2x mod 1 on (0,1] trivially (R = 1 on all (0,1]) assumes a 
Markov Tower structure with {(0, 1/2], (1/2, 1]} as the partition. 

Example 3: (Gaspard-Wang map) A non trivial example is the following non-uniformly expanding 
1-dimensional parabolic map on the interval with a neutral fixed point. The reference measure 
is the Lebesque measure. Let a € (0, 1). Then we define the map T : [0, 1] — > [0, 1] as follows 

j, + 2^+«, if, e [0,1] 

\2s-l, if x G (1/2,1] 

This map is non-uniformly expanding as its derivative, T'(x) = 1 + 2 a (l + a)x a ~ l equals 1 at 
x = 0. As a S (0, 1), T has an invariant absolutely continuous probability measure p. 
One uses the partition A = {Ai : i = 0, 1, . . . } whose elements are 
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where ao = \ and recursively T(a«) = aj_i for i = 1, 2, . . . . Note that a« ~ z - " [lj. T maps each 
interval A{ bijectively to A^i Aq = (^,1] is mapped bijectively onto the entire interval (0,1]. 
Hence T I+1 maps Ai bijectively to (0, 1]. The return time R is given by R\A{ = i + 1. Moreover 

oo 

\{R>n) = Y J KAi), 

i=n 

where A is the Lebesgue measure. As \{Ai) ~ i~a~^, we see that X(R > n) ~ n~a < oo. By 
Theorem[3]we obtain that for a £ (0, ^), the return times are in the limit Poisson distributed. 
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